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Abstract 

Many formalisms combining ontology languages with uncer¬ 
tainty, usually in the form of probabilities, have been stud¬ 
ied over the years. Most of these formalisms, however, as¬ 
sume that the probabilistic structure of the knowledge re¬ 
mains static over time. We present a general approach for 
extending ontology languages to handle time-evolving un¬ 
certainty represented by a dynamic Bayesian network. We 
show how reasoning in the original language and dynamic 
Bayesian inferences can be exploited for effective reasoning 
in our framework. 


Introduction 

Description Logics (DLs) (Baader el al. 2007) are a well- 
known family of knowledge representation formalisms that 
have been successfully employed for encoding the knowl¬ 
edge of many application domains. In DLs, knowledge is 
represented through a finite set of axioms, usually called an 
ontology or knowledge base. In essence, these axioms are 
atomic pieces of knowledge that provide explicit informa¬ 
tion of the domain. When mixed together in an ontology, 
these axioms may imply some additional knowledge that is 
not explicitly encoded. Reasoning is the act of making this 
implicit knowledge explicit through an entailment relation. 

Some of the largest and best-maintained DL ontologies 
represent knowledge from the bio-medical domains. For 
instance, the NCBO BioportaQ contains 420 ontologies of 
various sizes. In the bio-medical fields it is very common 
to have only uncertain knowledge. The certainty that an ex¬ 
pert has on an atomic piece of knowledge may have arisen 
from a statistical test, or from possibly imprecise measure¬ 
ments, for example. It thus becomes relevant to extend DLs 
to represent and reason with uncertainty. 

The need for probabilistic extensions of DLs has been 
observed for over two decades already. To cover it, many 


different formalisms have been introduced (Jaeger 1994 
Lukasiewicz and Straccia 2008[ [Lutz and Schroder 2010 


Klinov and Parsia 2011 1 . The differences in these logics 


range from the underlying classical DL used, to the seman¬ 
tics, to the assumptions made on the probabilistic compo¬ 
nent. One of the main issues that these logics need to handle 
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is the representation of joint probabilities, in particular when 
the different axioms are not required to be probabilistically 
independent. A recent approach solves this issue by divid¬ 
ing the ontology into contexts , which intuitively represent 
axioms that must appear together. The probabilistic knowl¬ 
edge is expressed through a Bayesian network that encodes 
the joint probability distribution of these contexts. Although 
originally developed as an extension of the DL EC ( |Cey-| 
|lan and Penaloza 2014b] ), the framework has been extended 
to arbitrary ontology languages with a monotone entailment 
relation (Ceylan and Penaloza 20 14a) . 

One common feature of the probabilistic extensions of 
DLs existing in the literature is that they consider the prob¬ 
ability distribution to be static. For many applications, this 
assumption does not hold: the probability of a person to have 
gray hair increases as time passes, as does the probability of 
a computer component to fail. To the best of our knowledge, 
there is so far no extension of DLs that can handle evolving 
probabilities effectively. 


In this paper, we describe a general approach for extend¬ 
ing ontology languages to handle evolving probabilities. By 
extension, our method covers all DLs, but is not limited to 


them. The main idea is to adapt the formalism from (Cey- 
lan and Pefialoza 2014a|> to use dynamic Bayesian networks 


(DBNs) ( Murphy 2002) as the underlying uncertainty struc¬ 
ture to compactly encode evolving probability distributions. 


Given an arbitrary ontology language C, we define its dy¬ 
namic Bayesian extension VBC. We show that reasoning in 
VBC can be seamlessly divided into the probabilistic com¬ 
putation over the DBN, and the logical component with its 
underlying entailment relation. In order to reduce the num¬ 
ber of entailment checks, we compile a so-called context 
formula, which encodes all contexts in which a given conse¬ 
quence holds. 

Related to our work are relational BNs ( [Jaeger 1997| > and 
their extensions. In contrast to relational BNs, we pro¬ 
vide a tight coupling between the logical formalism and 
the DBN, which allows us to describe evolving probabilities 
while keeping the intuitive representations of each individ¬ 
ual component. Additionally, restricting the logical formal¬ 
ism to specific ontology languages provides an opportunity 
for finding effective reasoning algorithms. 


































Bayesian Ontology Languages 

To remain as general as possible, we do not fix a specific 
logic, but consider an arbitrary ontology language C consist¬ 
ing of two infinite sets 21 and £' of axioms and consequences, 
respectively, and a class D C pfj n (2l) of finite sets of ax¬ 
ioms, called ontologies, such that if O £ D, then O' G O for 
all O' C O. The language C is associated to a class 3 of in¬ 
terpretations and an entailment relation |= C 3x(2lU£). An 
interpretation I £ 3 is a model of the ontology O (I \= O) 
if X \= a for all a £ O. O entails c £ £ (O \= c) if 
every model of O entails c. Notice that the entailment rela¬ 
tion is monotonic; i.e., if O \= c and O C O' £_ D, then 


O' |= c. Any standard description logic (DL) (Baader et al. 
|2007j ) is an ontology language of this kind; consequences 
in these languages are e.g. concept unsatisfiability, concept 
subsumption, or query entailment. However, many other 
ontology languages of varying expressivity and complexity 
properties exist. For the rest of this paper, C is an arbitrary 
but fixed ontology language, with axioms 21, ontologies O, 
consequences £, and interpretations 3. 

As an example language we use the DL EC (|Baader et 


al. 2005), which we briefly introduce here. Given two dis¬ 


joint sets Nc and Nr, EC concepts are built by the gram¬ 
mar rule C ::= A | T | C n C | 3 r.C where A £ Nc 
and r € Nr. EC, axioms and consequences are expressions 
of the form COD, where C and D are concepts. An 
interpretation is a pair (A 1 , x ) where A 1 is a non-empty 
set and x maps every As Nc to A 1 C A 1 and every 
r £ Nr to r 1 C A 1 x A 1 . This function is extended 
to concepts by T 1 := A 1 , (C H D) x := C 1 P\D X , and 
( 3r.C) x := {d | 3e £ C x .(d,e) £ r 1 }. This interpretation 
entails the axiom (or consequence) C A D iff C 1 C D 1 . 

The Bayesian ontology language BC extends C by associ¬ 
ating each axiom in an ontology with a context, which intu¬ 
itively describes the situation in which the axiom is required 
to hold. The knowledge of which context applies is uncer¬ 
tain, and expressed through a Bayesian network (Ceylan and 
|Penaloza 2014a| l. 

Briefly, a Bayesian network ( |Darwiche~ 2 009) is a pair 
B = (G , <l>j, where G = ( V., E) is a finite directed acyclic 
graph (DAG) whose nodes represent Boolean random vari¬ 
ables, and $ contains, for every x £ V, a conditional proba¬ 
bility distribution Pb(x \ n(x)) of x given its parents n(x). 
Every variable x £ V is conditionally independent of its 
non-descendants given its parents. Thus, the BN B defines a 
unique joint probability distribution (JPD) over V: 

Pb(V) = Pb{x | 7r(a;)). 

x&V 


Let V be a finite set of variables. A V-context is a con¬ 
sistent set of literals over V. A V-axiom is an expression 
of the form (a : n) where a £ 21 is an axiom and k is a 
L-context. A V-ontology is a finite set 0 of L-axioms, such 
that {a | (a : n) £ 0} £ 0. A BC knowledge base (KB) 
over V is a pair K, = (0, B) where B is a BN over V and 0 
is a L-ontology. 

We briefly illustrate these notions over the language BEC, 
an extension of the DL EC, in the following Example. 
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Figure 1: The BN B\ over the variables V = {x, y, z\ 



Example 1. Consider the BEC KB /C| = (Si, Oi) where 
0i={ (Comp C 3use.Memf1 3use.CPU : 0) , 

(3use.FailMem C FailComp : {x}) , 

(3use.FailCPU C FailComp : {x}) , 

(3use.FailMem n 3use.FailCPU C FailComp:{-ia;}) , 
(Mem C FailMem : {y}) , (CPU C FailCPU : {z})}, 
and B\ is the BN shown in Figure [T] 

This KB represents a computer failure scenario, where x 
stands for a critical situation, y represents the memory fail¬ 
ing, and z the CPU failing. 

The contextual semantics is defined by extending in¬ 
terpretations to evaluate also the variables from V. A 
V-interpretation is a pair I = (I, V 1 ) where I £ 3 and 
V 1 is a propositional interpretation over the variables V. 
The U-interpretation I = (I , V 1 ) is a model of (a : k) 
(I |= (a : k)), where a £ 21, iff (V 1 ft)0or (I |= a). 

It is a model of the V -ontology 0 iff it is a model of all 
the V'-axioms in 0. It entails c £ £ if X |= c. The intuition 
behind this semantics is that an axiom is evaluated to true by 
all models provided it is in the right context. 

Given a U-ontology 0, every propositional inter¬ 
pretation, or world, W on V defines an ontology 
Ow := {a | (a : n) £ 0, W (= p k}. Consider the KB K ,i 
provided in Example [lj The world W = {x,^y,z} defines 
the ontology 

Oyy={C°mp O 3use.Mem n 3use.CPU, 

3use.FailMem C FailComp, 

3use.FailCPU C FailComp, CPU C FailCPU}. 

Intuitively, a contextual ontology is a compact representa¬ 
tion of exponentially many ontologies from C\ one for each 
world W. The uncertainty in BC is expressed by the BN B, 
which is interpreted using multiple world semantics. 
Definition 2 (probabilistic interpretation). A probabilistic 
interpretation is a pair V = (I, Pi), where I is a set of 
U-interpretations and Pi is a probability distribution over 
I such that Pi (I) > 0 only for finitely many interpretations 
I £ I. It is a model of the U-ontology 0 if every I £ I is 
a model of 0. V is consistent with the BN B if for every 
valuation W of the variables in V it holds that 

E W = P b( W )- 

iei, v ! =w 

2 We use \= p to distinguish propositional entailment from |=. 


























The probabilistic interpretation V is a model of the KB 
(B, 0) iff it is a model of 0 and consistent with B. 

The fundamental reasoning task in BC, probabilistic en- 
tailment, consists in finding the probability of observing a 
consequence c; that is, the probability of being at a context 
where c holds. 


Definition 3 (probabilistic entailment). Let c £ £, and K. 
a BC KB. The probability of c w.r.t. the probabilistic inter¬ 
pretation V = (I, Pi) is P v (c) := E(i,w)ei,z|=c w )- 
The probability of c w.r.t. K, is Pjc{c) := inf-p^ P-p(c). 

It has been shown that to compute the conditional proba¬ 
bility of a consequence c, it suffices to test, for each world 
VV, whether Oyy entails c ([Ceylan and Penaloza 2014b). 

Proposition 4. Let K. = (B. 0) be a BC KB and c G C. 
Then P K {c) = ^ 0w]=c P B (W). 


This means that reasoning in BC can be reduced to expo¬ 
nentially many entailment tests in the classical language C. 
For some logics, this exponential enumeration of worlds can 
be avoided ( Ceylan and Penaloza 2014c} . However, this de¬ 
pends on the properties of the ontological language and its 
entailment relation, and cannot be guaranteed for arbitrary 
languages. 

Another relevant problem is to compute the prob¬ 
ability of a consequence given some partial informa¬ 
tion about the context. Given a context k, the con¬ 
ditional probability Pjc(c \ n) is defined via the rule 

Pk(c ,«) = Pk(c | k)Pb(k), where 


Pk(c,k)= P ^ W )- 

Oyy |=C,W|=pK 


For simplicity, in the rest of this paper we consider only un¬ 
conditional consequences. However, it should be noted that 
all results can be transferred to the conditional case. 

Example 5. Consider again the KB K. i = (B \ . Oj) from 
Example Tjand the consequence Comp C FailComp. We are 
interested in finding the probability of the computer to fail, 
i.e. P^^Comp C FailComp). This can be computed by enu¬ 
merating all worlds W for which Oyy |= Comp C FailComp, 
which yields the probability 0.238. 

As seen, it is possible to extend any ontological language 
to allow for probabilistic reasoning based on a Bayesian net¬ 
work. We now further extend this formalism to be able to 
cope with controlled updates of the probabilities over time. 


Dynamic Bayesian Ontology Languages 

With BC, one is able to represent and reason about the uncer¬ 
tainty of the current context, and the consequences that fol¬ 
low from it. In that setting, the joint probability distribution 
of the contexts, expressed by the BN, is known and fixed. In 
some applications, see especially ( jSadilek and Kautz 2010] ) 
this probability distribution may change over time. For ex¬ 
ample, as the components of a computer age, their probabil¬ 
ity of failing increases. The new probability depends on how 
likely it was for the component to fail previously, and the 
ageing factors to which it is exposed. We now extend BC to 



Figure 2: The TBN B- ¥ over the variables V = {x. y. z} 


handle these cases, by considering dynamic BNs as the un¬ 
derlying formalism for managing uncertainty over contexts. 
Dynamic BNs (DBNs) (jDean and Kanazawa 1989 Mur- 


[phy 2002] l extend BNs to provide a compact representa¬ 
tion of evolving joint probability distributions for a fixed set 
of random variables. The update of the JPD is expressed 
through a two-slice BN, which expresses the probabilities at 
the next point in time, given the current context. 


Definition 6 (DBN). Let V be a finite set of Boolean ran¬ 
dom variables. A two-slice BN (TBN) over V is a pair 
(G,<1>), where G = (V U V',E) is a DAG containing 
no edges between elements of V, V' = { x 1 \ x £ V}, 
and $ contains, for every x’ £ V' a conditional prob¬ 
ability distribution P{x 1 \ tt{x')) of x' given its parents 
7r( x'). A dynamic Bayesian network (DBN) over V is a pair 
V = (0i, where B\ is a BN over V, and B is a TBN 
over V. 


A TBN over V = {x, y, z ) is depicted in Figure [2] The 
set of nodes of the graph can be thought of as containing two 
disjoint copies of the random variables in V. Then, the prob¬ 
ability distribution at time t + 1 depends on the distribution 
at time t. In the following we will use Vt and x t for x £ V, 
to denote the variables in V at time t. 

As standard in BNs, the graph structure of a TBN encodes 
the conditional dependencies among the nodes: every node 
is independent of all its non-descendants given its parents. 
Thus, for a TBN B, the conditional probability distribution 
at time t + 1 given time t is 


PB(v t+ 1 1 v t )= n p s& i < x '))- 

x'ev' 


We further assume the Markov property: the probability 
of the future state is independent from the past, given the 
present state. 

In addition to the TBN, a DBN contains a BN £>i that en¬ 
codes the JPD of V at the beginning of the evolution. Thus, 
the DBN T> = (B\, B^ ) defines, for every t > 1, the unique 
probability distribution 


PbW) = P Bl (Vi) nn Pt3^{Xi | 7r(Xi)). 

i—2 xGV 


Intuitively, the distribution at time t is defined by unraveling 
the DBN starting from B\, using the two-slice structure of 
£L r until t copies of V have been created. This produces a 






























Figure 3: Three step unraveling B\ ,3 of (B\. B^) 


new BN £> 1:i encoding the distribution over time of the dif¬ 
ferent variables. Figure [3] depicts the unraveling to t = 3 
of the DBN ( B \, P_,) where B\ and B-, are the networks 
depicted in Figures [T] and [2] respectively. The conditional 
probability tables of each node given its parents (not de¬ 
picted) are those of B\ for the nodes in Vi, and of £>_> for 
nodes in V 2 U V :i . Notice that B\ :t has t copies of each ran¬ 
dom variable in V. For a given / > 1, we call B f the BN 
obtained from the unraveling B\-t of the DBN to time t, and 
eliminating all variables not in V t . In particular, we have that 

PbAV) = 

The dynamic Bayesian ontology language VBC is very 
similar to BC , except that the probability distribution of the 
contexts evolves accordingly to a DBN. 

Definition 7 ( VBC KB). A VBC knowledge base (KB) is a 
pair /C = (T>, 0) where V = (B\, B^) is a DBN over V and 
0 is a V-ontology. Let 1C = {V, 0) be a VBC KB over V. 
A timed probabilistic interpretation is an infinite sequence 
(p = (Vt)t> 1 of probabilistic interpretations. 'P is a model 
of K. if for every t, Vt is a model of the BC KB (B f , 0). 

In a nutshell, a DBN can be thought of as a compact rep¬ 
resentation of an infinite sequence of BNs £>i,£> 2 , ... over 
V. Following this idea, a VBC KB expresses an infinite 
sequence of BC KBs, where the ontological component re¬ 
mains unchanged, and only the probability distribution of 
the contexts evolves over time. A timed probabilistic inter¬ 
pretation p simply interprets each of these BC KBs, at the 
corresponding point in time. To be a model of a VBC KB, 
P must then be a model of all the associated BC KBs. 

Before describing the reasoning tasks for VBC and meth¬ 
ods for solving them, we show how the computation of all 
the contexts that entail a consequence can be reduced to 
the enumeration of the worlds satisfying a propositional for¬ 
mula. 


Compiling Contextual Knowledge 

From Proposition [4] we see that reasoning in BC can be re¬ 
duced to checking, for every world W, whether Oyy (= c. 
This reduces probabilistic reasoning to a sequence of stan¬ 
dard entailment tests over the original language C. However, 
each of these entailments might be very expensive. For ex¬ 
ample, in the very expressive DL STLOIQ , deciding an en¬ 
tailment is already NExpTlME-hard (Tobies 2000 1 . Rather 
than repeating this reasoning step for every world, it makes 
sense to try to identify the relevant worlds a priori. We do 
this through the computation of a context formula. 


Definition 8 (context formula). Let 0 be a L-ontology, and 
c £ €. A context formula for c w.r.t. 0 is a propositional for¬ 
mula cf> such that for every interpretation W of the variables 
in V, it holds that Oyy (= c iff W \= p </>. 


The idea behind this formula is that, for finding whether 
Oyy |= c, it suffices to check whether the valuation W satis¬ 
fies </>. This test requires only linear time on the length of the 
context formula. The context formula can be seen as a gen¬ 
eralization of the pinpointing formula (Baader and Penaloza 


2010b I and the boundary (Baader et al. 2012), defined orig¬ 


inally for classical ontology languages. 


Example 9. Consider again the L-ontology Oi from Exam¬ 
ple^ The formula := (x A (y V z)) V (~<x A y A z) is a 
context formula for Comp C FailComp w.r.t. Oi. In fact, the 
valuation {x, ~xy, z} satisfies this formula. 


Clearly, computing the context formula must be at least 
as hard as deciding an entailment in C: if we label every 
axiom in a classical £-ontology O with the same proposi¬ 
tional variable x , then the boundary formula of c w.r.t. this 
{x}-ontology is x iff O \= c. On the other hand, the al¬ 
gorithm used for deciding the entailment relation can usu¬ 
ally be modified to compute the context formula. Using 
arguments similar to those developed for axiom pinpoint¬ 
ing ( |Baader and Penaloza 201Oaj Kalyanpur et al. 2007) ), 
it can be shown that for most description logics, comput¬ 
ing the context formula is not harder, in terms of computa¬ 
tional complexity, than standard reasoning. In particular this 
holds for any arbitrary ontology language whose entailment 
relation is ExpTlME-hard. This formula can also be com¬ 
piled into a more efficient data structure like binary decision 
diagrams ( |Lee 1959j >. Intuitively, this means that we can 
compute this formula using the same amount of resources 
needed for only one entailment test, and then use it for ver¬ 
ifying whether the sub-ontology defined by a world entails 
the consequence in an efficient way. 


Reasoning in VBC 

Rather than merely computing the probability of currently 
observing a consequence, we are interested in computing 
the probability of a consequence to follow after some fixed 
number of time steps t. 

Definition 10 (probabilistic entailment with time). Let 
1C = ( V , 0) be a VBC KB and c £ £. Given a timed in¬ 
terpretation ^3 and t > 1, the probability of c at time t w.r.t. 
*P is P<p(c[f]) := Pp t (c). The probability ofc at time t w.r.t. 
K. is P/c(c[f]) := inLp h/c P<p(c[f]). 

We show that probabilistic entailment over a fixed time 
bound can be reduced to probabilistic entailment defined for 
BOLs by unravelling the DBN. 

Lemma 11. Let K, = ( V , 0) be a VBC KB, c £ €, and 
t > 1. Then the probability of c at time t w.r.t. K. is given by 

Pic(c[t))= £ P Bt (W) 

O w |= c 

Proof. (Sketch) A timed model of K, is a sequence of 
probabilistic interpretations V\,V-z, ■ ■ ■, where each V, is a 
























model of the BC KB /Cj := {Bi, 0). We use this fact to 
show that 


p K.{c[t}) 


inf Pp(c[f]) = 

inf P Vt (c) 

(1) 

tp|=/c 

4? NC 


inf Pp t (c) = 

P K t ( c ) 

(2) 

PtMCt 


E 


(3) 


Owl =c 


where 0 follows from Definition [TO] ( | 2 | holds by defini¬ 
tion, and 0 follows from Proposition^] □ 

Lemma[TT|provides a method for computing the probabil¬ 
ity of an entailment at a fixed time t. One can first generate 
the BN Bt, and then compute the probability w.r.t. Bt of all 
the worlds that entail c. Moreover, using a context formula 
we can compile away the ontology and reduce reasoning to 
standard inferences in BNs, only. 

Theorem 12. Let K, = [V, 0) be a VBC KB, c £ i t, f a 
context formula for c w.r.t. 0, and t > 1. Then the probabil¬ 
ity of c at time t w.r.t. JC is given by Pjc(c[t]) = Pg t (</>). 


Proof. By Lemma o and the definition of a context for¬ 
mula, we have 


p>dc[t})= E p B t m= E PB t m = PeM, 

Owl=c W |= p <p 


which proves the result. 


□ 


This means that one can first compute a context formula 
for c and then do probabilistic inferences over the DBN to 
detect the probability of satisfying (!) at time l. For this, we 
can exploit any existing DBN inference method. One option 
is to do variable elimination over the i-step unraveled DBN 
Bi-t, to compute Bt- Assuming that t is fixed, it suffices 
to make 21 v inferences (one for each world) over Bt and 
the same number of propositional entailment tests over the 
context formula. If entailment in £ is already exponential, 
then computing the probability of c at time t is as hard as 
deciding entailments. 

The previous argument only works assuming a fixed time 
point t. Since it depends heavily on computing Bt (e.g., via 
variable elimination), it does not scale well as t increases. 
Other methods have been proposed for exploiting the recur¬ 
sive structure of the DBN. For instance, one can use the al¬ 
gorithm described in (Vlasselaer et al. 2014) that provides 
linear scalability over time. The main idea is to compile the 
structure into an arithmetic circuit ( jDarwiche 2009) and use 
forward and backward message passing ( Murphy 2002) . 

While computing the probability of a consequence at a 
fixed point in time f is a relevant task, it is usually more im¬ 
portant to know whether the consequence can be observed 
within a given time limit. In our computer example, we 
would be interested in finding the probability of the system 
failing within, say, the following twenty steps. 

Abusing of the notation, we use the expression ~ c, c £ C, 
to denote that the consequence c does not hold; i.e., I |= ~ c 
iff I Y= c. Thus, for example, Pjc(~ c[f]) is the probability 


of c not holding at time t. To find the probability of observ¬ 
ing c in the first t time steps, one can alternatively compute 
the probability of not observing c in any of those steps. For¬ 
mally, for a timed interpretation ip and f £ N, we define 

Py(c[l ■ f]) := 1 - P<p(~c[l], ... ,~c[f]). 

Definition 13 (time bounded probabilistic entailment). The 
probability of observing c in at most t steps w.r.t. the VBC 
KB K. is P K (c[ 1 : t]) := inf^ Pp(c[ 1 : t}). 

Just as before, given a constant t > 1, it is possible to 
compute P K {c[\ : t]) by looking at the t-step unraveling 
of V. More precisely, to compute P<p(~ c[l],..., ~ c[t]), it 
suffices to look at all the valuations W of ]J * =1 V) such that 
for ali i, 1 < i < t, it holds that 0 W (,\ |= ~ c. These val¬ 
uations correspond to an evolution of the system where the 
consequence c is not observed in the first t steps. The prob¬ 
ability of these valuations w.r.t. B\-t then yields the proba¬ 
bility of not observing this consequence. We thus get the 
following result. 

Theorem 14. Let K, = (V, 0) be a VBC KB, c £ £, 
f a context formula for c w.r.t. 0, and t > 1. Then 
P/c(c[l : f]) = X)w| 3 i.VV(i)|= p 0 -^ > Si:t(W). 

Proof (Sketch). Using the pithy interpretations of the crisp 
ontologies Oyv(i), we can build a timed interpretation *Po 

such that Pp 0 (c[l : t}) = Ew| 3 t wfflh„ 4 > p Bi !t (W), in 

a way similar to Theorem 12 of (Ceylan and Peiialoza 
2014b| ). The existence of another timed interpretation \p 
such that Prp(c[l : t]) < P<p 0 (c[l : t]) contradicts the 
properties of the pithy interpretations. Thus, we obtain that 

JWc[l : *]) = in %|=A:Pp(c[l : t}) = P/c(c[l : t]). □ 

This means that the probability of observing a conse¬ 
quence within a fixed time-bound t can be computed by 
simply computing the context formula and then performing 
probabilistic a posteriori computations over the unraveled 
BN. In our running example, the probability of observing a 
computer failure in the next 20 steps is simply 

P;e(Comp C FailComp[l : 20]) = E 

W\3i.W(i)\= p 4> 

Thus, the computational complexity of reasoning is not af¬ 
fected by introducing the dynamic evolution of the BN, as 
long as the time bound is constant. Notice, however, that 
the number of possible valuations grows exponentially on 
the time bound t. Thus, for large intervals, this approach 
becomes unfeasible. 

By extending the time limit indefinitely, we can also 
find the probability of eventually observing the consequence 
c (e.g., the probability of the system ever failing). The 
probability of eventually observing c w.r.t. K. is given by 
P/c(c[oo]) := lim^oo P K (c[l : t\). Notice that P K (c[l : t]) 
is monotonic on t and bounded by 1 ; hence Pjc(c[oo]) is 
well defined. 

Observe that Theorem [14] cannot be used to compute the 
probability of eventually observing c since one cannot neces¬ 
sarily predict the changes in probabilities of finding worlds 
that entail the consequence c. Rather than considering these 
















increasingly large BNs separately, we can exploit methods 
developed for probability distributions that evolve over time. 
This will also allow us to extract more information from 
VBC KBs. 

It is easy to see that every TBN defines a time- 
homogeneous Markov chain over a finite state space. More 
precisely, if B is a TBN over V, then _A/fg is the Markov 
chain, where every valuation W of the variables in V is a 
state and the transition probability distribution given the cur¬ 
rent state W is described by the BN obtained from adding VV 
as evidence to the first slice of B. For example, the TBN £>_, 
from Figure[2]yields the conditional probability distribution 
given that {x : y , z } was observed at time t depicted in Fig¬ 
ure]?] From this, we can derive the probability of observing 
{; x , y, z} at time t + 1 given that it was observed at time t, 
which is P({x, y, z} t + 1 \{x,y,z} t ) = 0.252. 

We extend the notions from Markov chains to TBNs in 
the obvious way. In particular, the TBN B is irreducible 
if for every two worlds V, W, the probability of eventually 
reaching VV given V is greater than 0. It is aperiodic if for 
every world VV there is an nyy such that for all n > nw, 
it holds that P(W n | VV) > 0. A distribution P\y over the 
worlds is stationary if P(V \ W)Pw{ W) = P\v(V) 
holds for every world V. It follows immediately that if B 
is irreducible and aperiodic, then it has a unique stationary 
distribution ( [Harris 1956] ). 

Given a TBN B over V, let now Ag be the set of all sta¬ 
tionary distributions over the worlds of V. For a world W, 
define 5b(W) := minp e A B P( W) to be the smallest prob¬ 
ability assigned by any stationary distribution of B to VV. If 
(5 b(W) > 0, then we know that, regardless of the initial dis¬ 
tribution, in the limit we will always be able to observe the 
world VV with a constant positive probability. In particular, 
this means that the probability of eventually observing W 
equals 1. Notice moreover that this results is independent of 
the initial distribution used. 

We can take this idea one step forward, and consider sets 
of worlds. For a propositional formula <f>, let 

: = P ( W )- 

In other words, 5b (</>) expresses the minimum probability of 
satisfying <f> in any stationary distribution of B. From the 
arguments above, we obtain the following theorem. 

Theorem 15. Let K, = ( V , 0) be a VBC KB over V with 


V = ( Bi,B _►), c € C, and f a context formula for c w.r.t. 
0. If 5 b > 0, then P^(c[oo]) = 1. 

In particular, if B- ¥ is irreducible and aperiodic, Ag con¬ 
tains only one stationary distribution, which simplifies the 
computation of the function 5. Unfortunately, such a sim¬ 
ple characterization of P^(c[oo]) cannot be given when 
( 0 ) = 0. In fact, in this case the result may depend 
strongly on the initial distribution. 

Example 16. Let V = {cc}, O 2 = {{A C B : { 1 })}, and 
consider the TBN B'_ over V defined by P(x' | x) = 1 
and P(x! \ -ix) = 0. It is easy to see that any distribu¬ 
tion over the valuations of V is stationary. For every initial 
distribution B, if K. = (V, O 2 ) where V = (B,B'_^), then 
Pk(A E B[o o]) = P B O). 

So far, our reasoning services have focused on predicting 
the outcome at future time steps, given the current knowl¬ 
edge of the system. Based on our model of evolving prob¬ 
abilities, the distribution at any time t + 1 depends only on 
time t, if it is known. However, for many applications it 
makes sense to consider evidence that is observed through¬ 
out several time steps. For instance, in our computer fail¬ 
ure scenario, the DBN £> _, ensures that, if at some point a 
critical situation is observed (x is true), then the probability 
of observing a memory or CPU failure in the next step is 
higher. That is, the evolution of the probability distribution 
is affected by the observed value of the variable x. 

Suppose that we have observed over the first t time steps 
that no critical situation has occurred, and we want to know 
the probability of a computer failure. Formally, let E be a 
consistent set of literals over jj* =1 1 f. We want to compute 
the probability Pjc(c[i] | E ) of observing c at time t given 
the evidence E. This is just a special case of bounded prob¬ 
abilistic entailment, where the worlds are not only restricted 
w.r.t. the context formula but also w.r.t. the evidence E. 

The efficiency of this approach depends strongly on the 
time bound t, but also on the structure of the TBN Re¬ 
call that the complexity of reasoning in a BN depends on 
the tree-width of its underlying DAG ( |Pan et al. 1998j >. The 
unraveling of S_>. produces a new DAG whose tree-width 
might increase with each unraveling step, thus impacting the 
reasoning methods negatively. 

Conclusions 

We have introduced a general approach for extending on¬ 
tology languages to handle time-evolving probabilities with 
the help of a DBN. Our framework can be instantiated to 
any language with a monotonic entailment relation includ¬ 
ing, but not limited to, all the members of the description 
logic family of knowledge representation formalisms. 

Our approach extends on ideas originally introduced for 
static probabilistic reasoning. The essence of the method 
is to divide an ontology into different contexts, which are 
identified by a consistent set of propositional variables from 
a previously chosen finite set of variables V. The probabilis¬ 
tic knowledge is expressed through a probability distribution 
over the valuations of V which is encoded by a DBN. 

Interestingly, our formalism allows for reasoning meth¬ 
ods that exploit the properties of both, the ontological, and 














the probabilistic components. From the ontological point of 
view, we can use suplemental reasoning to produce a context 
formula that encodes all the possible worlds from which a 
wanted consequence can be derived. We can then use stan¬ 
dard DBN methods to compute the probability of satisfying 
this formula. 

This work represents first steps towards the development 
of a formalism combining well-known ontology languages 
with time-evolving probabilities. First of all, we have intro¬ 
duced only the most fundamental reasoning tasks. It is pos¬ 
sible to think of many other problems like finding the most 
plausible explanation for an observed event, or computing 
the expected time until a consequence is derived, among 
many others. 

Finally, the current methods developed for handling 
DBNs, although effective, are not adequate for our prob¬ 
lems. To find out the probability of satisfying the context 
formula <f>, we need compute the probability of each of the 
valuations that satisfy <!> at different points in time. Even us¬ 
ing methods that exploit the structure of the DBN directly, 
the information of the context formula is not considered. 
Additionally, with the most efficient methods to-date, it is 
unclear how to handle the evidence over time effectively. 
Dealing with these, and other related problems, will be the 
main focus of our future work. 

Acknowledgments 

ismail ilkan Ceylan is supported by DFG within the 
Research Training Group “RoSI” (GRK 1907). Rafael 
Penaloza was partially supported by DFG through the Clus¬ 
ter of Excellence ‘cfAED,’ while he was still affiliated with 
TU Dresden and the Center for Advancing Electronics Dres¬ 
den, Germany. 

References 

[Baader and Penaloza 2010a] Franz Baader and Rafael 
Penaloza. Automata-based Axiom Pinpointing. J. of 
Automated Reasoning, 45(2):91-129, 2010. 

[Baader and Penaloza 2010b] Franz Baader and Rafael 
Penaloza. Axiom Pinpointing in General Tableaux. J. of 
Logic and Computation, 20( 1):5—34, 2010. 

[Baader et al. 2005] Franz Baader, Sebastian Brandt, and 
Carsten Lutz. Pushing the EC envelope. In Proc. of 
IJCAI’05, pages 364-369. Morgan Kaufmann Publishers, 
2005. 

[Baader et al. 2007] Franz Baader, Diego Calvanese, Deb¬ 
orah L McGuinness, Daniele Nardi, and Peter F Patel- 
Schneider, editors. The Description Logic Handbook: The¬ 
ory, Implementation, and Applications. Cambridge Univer¬ 
sity Press, 2nd edition, 2007. 

[Baader et al. 2012] Franz Baader, Martin Knechtel, and 
Rafael Penaloza. Context-Dependent Views to Axioms and 
Consequences of Semantic Web Ontologies. J. of Web Se¬ 
mantics, 12—13:22—40, 2012. 

[Ceylan and Penaloza 2014a] ismail ilkan Ceylan and Rafael 
Penaloza. Bayesian Description Logics. In Proc. ofDL’14, 


volume 1193 of CEUR Workshop Proceedings. CEUR-WS, 
2014. 

[Ceylan and Penaloza 2014b] ismail ilkan Ceylan and 
Rafael Penaloza. The Bayesian Description Logic BEC. In 
Proc. of IJCAR’14, volume 8562 of LNCS, pages 480-494. 
Springer Verlag, 2014. 

[Ceylan and Penaloza 2014c] ismail ilkan Ceylan and Rafael 
Penaloza. Tight Complexity Bounds for Reasoning in the 
Description Logic BEC. In Proc. of TELIA' 14, volume 8761 
of LNCS, pages 77-91. Springer Verlag, 2014. 

[Darwiche 2009] Adnan Darwiche. Modeling and Reason¬ 
ing with Bayesian Networks. Cambridge University Press, 
2009. 

[Dean and Kanazawa 1989] Thoma Dean and Keiji 
Kanazawa. A model for reasoning about persistence 
and causation. Computational intelligence, pages 142-150, 
1989. 

[Harris 1956] Theodore Edward Harris. The existence of sta¬ 
tionary measures for certain Markov processes. In Proc. of 
Berkeley Symposium on Mathematical Statistics and Proba¬ 
bility, volume 2. University of California Press, 1956. 

[Jaeger 1994] Manfred Jaeger. Probabilistic Reasoning in 
Terminological Logics. In Proc. of KR’94, pages 305-316. 
AAAI Press, 1994. 

[Jaeger 1997] Manfred Jaeger. Relational Bayesian Net¬ 
works. In Proc. ofUAI’97, pages 266-273. Morgan Kauf¬ 
mann Publishers, 1997. 

[Kalyanpur et al. 2007] Aditya Kalyanpur, Bijan Parsia, 
Matthew Horridge, and Evren Sirin. Finding All Justifica¬ 
tions of OWL DL Entailments. In Proc. ofISWC’07, volume 
4825 of LNCS, pages 267-280. Springer Verlag, 2007. 

[Klinov and Parsia 2011] Pavel Klinov and Bijan Parsia. 
Representing sampling distributions in P-SROIQ. In Proc. 
of URSW’l 1, volume 778 of CEUR Workshop Proceedings. 
CEUR-WS, 2011. 

[Lee 1959] Chang-Yeong Lee. Representation of Switching 
Circuits by Binary-Decision Programs. Bell System Techni¬ 
cal Journal, 38(4), 1959. 

[Lukasiewicz and Straccia 2008] Thomas Lukasiewicz and 
Umberto Straccia. Managing uncertainty and vagueness 
in description logics for the Semantic Web. Web Seman¬ 
tics: Science, Services and Agents on the World Wide Web, 
6(4):291-308, November 2008. 

[Lutz and Schroder 2010] Carsten Lutz and Lutz Schroder. 
Probabilistic Description Logics for Subjective Uncertainty. 
In Proc. ofKR’10, pages 393-403. AAAI Press, 2010. 

[Murphy 2002] Kevin Patrick Murphy. Dynamic bayesian 
networks: representation, inference and learning. PhD the¬ 
sis, University of California, Berkeley, 2002. 

[Pan et al. 1998] Heping Pan, Daniel McMichael, and Marta 
Lendjel. Inference Algorithms in Bayesian Networks and 
the Probanet System. Digital Signal Processing, 8f4):231— 
243, 1998. 

[Sadilek and Kautz 2010] Adam Sadilek and Henry Kautz. 
Recognizing multi-agent activities from GPS data. In Proc. 
of AAAI'10, pages 1134-1139, 2010. 



[Tobies 2000] Stephan Tobies. The Complexity of Reason¬ 
ing with Cardinality Restrictions and Nominals in Expres¬ 
sive Description Logics. J. of Artificial Intel. Research, 
12:199-217,2000. 

[Vlasselaer et al. 2014] Jonas Vlasselaer, Wannes Meert, 
Guy Van Den Broeck, and Luc De Raedt. Efficient Prob¬ 
abilistic Inference for Dynamic Relational Models. In Proc. 
of StarAI’ 14, volume WS-14-13 of AAAI Workshops. AAAI 
Press, 2014. 



